Search CORE

259 research outputs found

Generative models for natural images

Author: Ahmed Faruk
Publication venue
Publication date: 01/08/2017
Field of study

Nous traitons de modèles génératifs construits avec des réseaux de neurones dans le contexte de la modélisation d’images. De nos jours, trois types de modèles sont particulièrement prédominants: les modèles à variables latentes, tel que l’auto-encodeur variationnel (VAE), les modèles autorégressifs, tel que le réseau de neurones récurrent pixel (PixelRNN), et les modèles génératifs antagonistes (GANs), qui sont des modèles à transformation de bruit entrainés à l’aide d’un adversaire. Cette thèse traite de chacun de ces modèles. Le premier chapitre couvre la base des modèles génératifs, ainsi que les réseaux de neurones pro- fonds, qui constituent la technologie principalement utilisée à l’heure actuelle pour l’implémentation de modèles statistiques puissants. Dans le deuxième chapitre, nous implémentons un auto-encodeur variationnel avec un décodeur auto-régressif. Cela permet de se libérer de l’hypothèse d’indépendance des dimensions de sortie du décodeur variationnel, en modélisant une distribution jointe traçable à la place, et de doter le modèle auto-régressif d’un code latent. De plus, notre implémentation a un coût computationnel significativement réduit, si on le compare à un modèle purement auto-régressif ayant les mêmes hypothèses de modélisation et la même performance. Nous décrivons l’espace latent de façon hiérarchique, et montrons de manière qualitative la décomposition sémantique des causes latente induites par ce design. Finalement, nous présentons des résultats obtenus avec des jeux de données standards et démontrant que la performance de notre implémentation est fortement compétitive. Dans le troisième chapitre, nous présentons une procédure d’entrainement améliorée pour une variante récente de modèles génératifs antagoniste. Le «Wasserstein GAN» minimise la distance, mesurée avec la métrique de Wasserstein, entre la distribution réelle et celle générée par le modèle, ce qui le rend plus facile à entrainer qu’un GAN avec un objectif minimax. Cependant, en fonction des paramètres, il présente toujours des cas d’échecs avec certain modes d’entrainement. Nous avons découvert que le coupable est le coupage des poids, et nous le remplaçons par une pénalité sur la norme des gradients. Ceci améliore et stabilise l’entrainement, et ce sur différents types du paramètres (incluant des modèles de langue sur des données discrètes), et permet de générer des échantillons de haute qualités sur CIFAR-10 et LSUN bedrooms. Finalement, dans le quatrième chapitre, nous considérons l’usage de modèles génératifs modernes comme modèles de normalité dans un cadre de détection hors-distribution «zero-shot». Nous avons évalué certains des modèles précédemment présentés dans la thèse, et avons trouvé que les VAEs sont les plus prometteurs, bien que leurs performances laissent encore un large place à l’amélioration. Cette partie de la thèse constitue un travail en cours. Nous concluons en répétant l’importance des modèles génératifs dans le développement de l’intelligence artificielle et mentionnons quelques défis futurs.We discuss modern generative modelling of natural images based on neural networks. Three varieties of such models are particularly predominant at the time of writing: latent variable models such as variational autoencoders (VAE), autoregressive models such as pixel recurrent neural networks (PixelRNN), and generative adversarial networks (GAN), which are noise-transformation models trained with an adversary. This thesis touches on all three kinds. The first chapter covers background on generative models, along with relevant discussions about deep neural networks, which are currently the dominant technology for implementing powerful statistical models. In the second chapter, we implement variational autoencoders with autoregressive decoders. This removes the strong assumption of output dimensions being conditionally independent in variational autoencoders, instead tractably modelling a joint distribution, while also endowing autoregressive models with a latent code. Additionally, this model has significantly reduced computational cost compared to that of a purely autoregressive model with similar modelling assumptions and performance. We express the latent space as a hierarchy, and qualitatively demonstrate the semantic decomposition of latent causes induced by this design. Finally, we present results on standard datasets that demonstrate strongly competitive performance. In the third chapter, we present an improved training procedure for a recent variant on generative adversarial networks. Wasserstein GANs minimize the Earth-Mover’s distance between the real and generated distributions and have been shown to be much easier to train than with the standard minimax objective of GANs. However, they still exhibit some failure modes in training for some settings. We identify weight clipping as a culprit and replace it with a penalty on the gradient norm. This improves training further, and we demonstrate stability on a wide variety of settings (including language models over discrete data), and samples of high quality on the CIFAR-10 and LSUN bedrooms datasets. Finally, in the fourth chapter, we present work in development, where we consider the use of modern generative models as normality models in a zero-shot out-of-distribution detection setting. We evaluate some of the models we have discussed previously in the thesis, and find that VAEs are the most promising, although their overall performance leaves a lot of room for improvement. We conclude by reiterating the significance of generative modelling in the development of artificial intelligence, and mention some of the challenges ahead

Dépôt Institutionnel Numérique

Ambient awareness on a sidewalk for visually impaired

Author: Ahmed Faruk
Publication venue: University of Memphis Digital Commons
Publication date: 01/01/2019
Field of study

Safe navigation by avoiding obstacles is vital for visually impaired while walking on a sidewalk. There are both static and dynamic obstacles to avoid. Detection, monitoring, and estimating the threat posed by obstacles remain challenging. Also, it is imperative that the design of the system must be energy efficient and low cost. An additional challenge in designing an interactive system capable of providing useful feedback is to minimize users\u27 cognitive load. We started the development of the prototype system through classifying obstacles and providing feedback. To overcome the limitations of the classification-based system, we adopted the image annotation framework in describing the scene, which may or may not include the obstacles. Both solutions partially solved the safe navigation but were found to be ineffective in providing meaningful feedback and issues with the diurnal cycle. To address such limitations, we introduce the notion of free-path and threat level imposed by the static or dynamic obstacles. This solution reduced the overhead of obstacle detection and helped in designing meaningful feedback. Affording users a natural conversation through an interactive dialog enabled interface was found to promote safer navigation. In this dissertation, we modeled the free-path and threat level using a reinforcement learning (RL) framework.We built the RL model in the Gazebo robot simulation environment and implanted that in a handheld device. A natural conversation model was created using data collected through a Wizard of OZ approach. The RL model and conversational agent model together resulted in the handheld assistive device called Augmented Guiding Torch (AGT). The AGT provides improved mobility over white cane by providing ambient awareness through natural conversation. It can inform the visually impaired about the obstacles which are helpful to be warned about ahead of time, e.g., construction site, scooter, crowd, car, bike, or big hole. Using the RL framework, the robot avoided over 95% obstacles. The visually impaired avoided over 85% obstacles with the help of AGT on a 500 feet U-shape sidewalk. Findings of this dissertation support the effectiveness of augmented guiding through RL for navigation and obstacle avoidance of visually impaired users

University of Memphis Digital Commons

A Doubly-Fed Induction Generator (DFIG)-Based Wind-Power System with Integrated Energy Storage for Remote Electrification

Author: Bhuiyan Faruk Ahmed
Publication venue: Scholarship@Western
Publication date: 01/01/2009
Field of study

Electrification of off-grid remote communities is commonly accomplished through diesel generators. The method may even be employed in cases where there exists an un reliable connection to the power grid. Regardless, the method is environmentally-hostile, typically costly, and likely risky. Therefore, to mitigate the reliance on diesel fuel, uti lization of renewable energy resources has been considered in recent years. This thesis investigates the feasibility of and technical considerations involved in the employment of a specific class of variable-speed wind-power systems, integrated with battery energy stor age, for remote electrification applications. The wind-power system under consideration is based on the doubly-fed induction gen erator (DFIG) technology, which features a number of characteristics that render it at tractive for the incorporation of battery energy storage. This thesis identifies the control strategy, different control sub-functions, and the controllers structures/parametes required to accommodate the battery energy storage. The developed control strategy enables the operation of the wind-power/storage system in the off-grid (islanded) mode of operation, as well as the grid-connected mode of operation. Under the developed control strategy, the wind-power/storage system can operate in parallel with constant-speed wind-power units, passive loads, and induction motor loads. The effectiveness of the proposed control strategy has been demonstrated through comprehensive simulation studies enabled by the commercial software package PSCAD/EMTDC. In addition to the control aspects, this thesis studies the reliability aspects of the pro posed wind-power/storage system, for an example remote electrification system. Thus, a new reliability assessment method has been developed in this thesis, which combines the existing analytical and simulation-based probabilistic approaches. The reliability analysis conducted indicates that the battery energy storage capacity, the wind magnitude and pro file, and the load profile impose remarkable impacts on the reliability of the electrification system. It also indicates that a connection to the power grid, however unreliable, signifi cantly mitigates the need for a large battery to achieve a given degree of reliability

Scholarship@Western

Factors associated with stress among first-year undergraduate students attending an Australian university

Author: Ahmed Faruk
Lee Patricia C
Papier Keren
Pathirana Thanya I.
Publication venue: 'Verizona Publisher Limited'
Publication date: 01/01/2016
Field of study

Objective: The aim of this study was to examine the relationship between stress and various socio-demographic, health and behavioural factors among undergraduate students studying in an Australian university. Methods: A cross-sectional survey was carried out among firstyear undergraduate students studying at Griffith University. Participants were recruited from four different academic groups (N=728). The questionnaire used in this study comprised of three sections: socio-demographic information, stress scale and a food frequency questionnaire. K-means Cluster analysis was performed to identify the major dietary patterns and multinomial logistic regression analysis was used to examine the factors associated with stress. Results: Nearly 53% of the students had some degree of stress with 37.4% experiencing moderate to severe levels of stress. The factors most strongly associated with having mild or moderate/ severe stress levels included being in a relationship [OR =1.71, 95% CI (1.02-2.87) and OR=1.61, 95% CI (1.06-2.44)], studying a non-health related degree [OR=1.68, 95% CI (1.03-2.73) and OR=1.51, 95% CI (1.04-2.19)], working ≥ 21 hours per week [OR=2.12, 95% CI (1.02-4.40) and OR=2.21, 95% CI (1.32-3.67)], and engaging in an unhealthy dietary pattern [OR=2.67, 95% CI (1.25-5.72) and OR=2.76, 95% CI (1.47-5.16)]. Being a female [OR=1.84, 95% CI (1.25-2.72)], living in a shared accommodation [OR=0.52, 95% CI (0.27-0.98)], rarely exercising [OR=2.64, 95% CI (1.59-4.39)], having a body mass index (BMI) of 25 or over [OR=2.03, 95% CI (1.36-3.04)], and engaging in a dietary pattern that was low in protein, fruit and vegetables [OR=1.72, 95% CI (1.06-2.77)] were also associated with having moderate/severe stress levels. Conclusion: This study found that more than half of the undergraduate students had some levels of stress. Both mild and moderate/severe levels of stress were associated with sociodemographic characteristics, risky health behaviours and poor dietary patterns. Our findings reinforce the need to promote healthy behaviours among undergraduate university students in order to maintain good mental health.</p

Bond University Research Portal

Oxford University Research Archive

Detecting semantic anomalies

Author: Ahmed Faruk
Courville Aaron
Publication venue
Publication date: 21/11/2019
Field of study

We critically appraise the recent interest in out-of-distribution (OOD) detection and question the practical relevance of existing benchmarks. While the currently prevalent trend is to consider different datasets as OOD, we argue that out-distributions of practical interest are ones where the distinction is semantic in nature for a specified context, and that evaluative tasks should reflect this more closely. Assuming a context of object recognition, we recommend a set of benchmarks, motivated by practical applications. We make progress on these benchmarks by exploring a multi-task learning based approach, showing that auxiliary objectives for improved semantic awareness result in improved semantic anomaly detection, with accompanying generalization benefits.Comment: Preprint for AAAI '20 publicatio

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Diet and nutritional status during pregnancy

Author: Faruk Ahmed
Marilyn Tseng
Publication venue: 'Cambridge University Press (CUP)'
Publication date
Field of study

Crossref